Affective classification of generic audio clips using regression models
نویسندگان
چکیده
We investigate acoustic modeling, feature extraction and feature selection for the problem of affective content recognition of generic, non-speech, non-music sounds. We annotate and analyze a database of generic sounds containing a subset of the BBC sound effects library. We use regression models, longterm features and wrapper-based feature selection to model affect in the continuous 3-D (arousal, valence, dominance) emotional space. The frame-level features for modeling are extracted from each audio clip and combined with functionals to estimate long term temporal patterns over the duration of the clip. Experimental results show that the regression models provide similar categorical performance as the more popular Gaussian Mixture Models. They are also capable of predicting accurate affective ratings on continuous scales, achieving 62-67% 3-class accuracy and 0.69-0.75 correlation with human ratings, higher than comparable numbers in literature.
منابع مشابه
Understanding Affective Content of Music Videos through Learned Representations
In consideration of the ever-growing available multimedia data, annotating multimedia content automatically with feeling(s) expected to arise in users is a challenging problem. In order to solve this problem, the emerging research field of video affective analysis aims at exploiting human emotions. In this field where no dominant feature representation has emerged yet, choosing discriminative f...
متن کاملPredicting Affect in Music Using Regression Methods on Low Level Features
Music has been shown to impact the affective states of the listener. The emotion in music task at the MediaEval challenge 2015 focuses on predicting the affective dimensions of valence and arousal in music using low level features. In particular, this edition of the challenge involves prediction on full length songs given a training set containing smaller 30 second clips. We approach the proble...
متن کاملبازشناسی خودکار حالت عاطفی مبتنی بر تغییرات فیزیولوژیک
Recently, automatic affective state recognition has been noteworthy for improving Human Computer Interaction (HCI), clinical researches and other various applications. Little attention has been paid so far to physiological signals for affective state recognition compared to audio-visual methods. Different affective states stimulate the Autonomic Nervous System (ANS) and lead to changes in physi...
متن کاملAudio-Video based Classification using SVM and AANN
This paper presents a method to classify audio-video data into one of five classes: advertisement, cartoon, news, movie and songs. Automatic audio-video classification is very useful to audio-video indexing, content based audio-video retrieval. Mel frequency cepstral coefficients are used to characterize the audio data. The color histogram features extracted from the images in the video clips a...
متن کاملContent-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines
In recent years, available audio corpora are rapidly increasing from fast growing Internet and digital libraries. How to classify and retrieve sound files relevant to the user’s interest from large databases is crucial for building multimedia web search engines. In this paper, content-based technology has been applied to classify and retrieve audio clips using a fuzzy logic system, which is int...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013